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Abstract 

The ABCD matrix is one of the essential mathematical instruments in optics. 
It is the two-by-two representation of the group Sp(2), which is applicable to 
many branches of physics, including squeezed states of light, special relativity 
and coupled oscillators. It is pointed out that the shear representation is ori- 
ented to binary logic which may be friendly to computer applications. While 
this is a future possibility, it is known that para-axial lens optics is based on 
the shear representation of the Sp{2) group. It is pointed out that the most 
general form of the ABCD matrix can be written in terms of six shear ma- 
trices, which correspond to lens and translation matrices. The parameter for 
each shear matrix is computed in terms of the three independent parameters 
of the ABCD matrix. 
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I. INTRODUCTION 



In a recent series of papers [0,0, Han et al. studied possible optical devices capable of 
performing the matrix operations of the following types: 

Since these matrices perform shear transformations in a two-dimensional space [^], we shall 
call them "shear" matrices. 

However, Han et al. were interested in computer applications of these shear matrices 
because they can convert multiplications into additions. Indeed, the T matrix has the 
property: 

r r r r _ fl oA (I a 2 \ _ (I a x + a 2 \ , x 

TlT2 "U iJIo i)-[o 1 J' () 

and the L matrix has a similar "slide-rule" property. This property is valid only if we restrict 
computations to the T-type matrices or to the L-type matrices. 

What happens if we use both L and T types? Then it will lead to a binary logic. In the 
present paper, we study this binary property of the ABCD matrix, which takes the form 

where the elements A, B,C and D are real numbers satisfying AD — BC = 1. Because of 
this condition, there are three independent parameters. 

We are interested in constructing the most general form of the ABCD matrix in terms of 
the two shear matrices given in Eq. ([!]). Two-by-two matrices with the above property form 
the symplectic group Sp(2). Indeed, we are quite familiar with the conventional representa- 
tion of the two-by-two representation of the Sp(2) group. This group is like (isomorphic to) 
SU(1, 1) which is the basic scientific language for squeezed states of light ||. This group 
is also applicable to other branches of optics, including polarization optics, interferometers, 
layer optics 0, and para-axial optics The Sp{2) symmetry can be found in many other 
branches of physics, including canonical transformations ||, special relativity Q], Wigner 
functions [|J, and coupled harmonic oscillators |§. 

Even though this group covers a wide spectrum of physics, the mathematical content 
of the present paper is minimal because we are dealing only with three real numbers. We 
use group theoretical theorems in order to manage our calculations in a judicious manner. 
Specifically, we use group theory to represent the most general form of the ABCD matrix in 
terms of the shear matrices given in Eq. ([]]), and to translate the group theoretical language 
into a computer friendly binary logic. 

With this point in mind, we propose to write the two- by-two ABCD matrices in the 
form 

TLTLT .... (4) 

Since each matrix in this chain contains one parameter, there are N parameters for N matrices 
in the chain. On the other hand, since both T and L are real unimodular matrices, the final 
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expression is also real unimodular. This means that the expression contains only three 
independent parameters. 

Then we are led the question of whether there is a shortest chain which can accommodate 
the most general form of the two-by-two matrices. We shall conclude in this paper that six 
matrices are needed for the most general form, with three independent parameters. While 
we had in mind possible future computer applications of this binary logic, we are not the 
first ones to study this problem from the point of view of ray optics. 

Indeed, in 1985, Sudarshan et al. raised essentially the same question in connection with 
para-axial lens optics They observed that the lens and translation matrices are in the 
form of matrices given in Eq. QT]) . In fact, the notations L and T for the shear matrices of 
Eq.(|l]) are derived from the words "lens" and "translation" respectively in para-axial lens 
optics. Sudarshan et al. conclude that three lenses are needed for the most general form 
for the two-by-two matrices for the symplectic group. Of course their lens matrices are 
appropriately separated by translation matrices. However, Sudarshan et al. stated that the 
calculation of each lens or translation parameter is "tedious" in their paper. 

In the present paper, we made this calculation less tedious by using a decomposition of 
the ABCD matrix derivable from Bargmann's paper ||. As far as the number of lenses 
is concerned, we reach the same conclusion as that of Sudarshan et al.. In addition, we 
complete the calculation of lens parameter for each lens and the translation parameter for 
each translation matrix, in terms of the three independent parameters of the ABCD matrix. 

In Sec. H, it is noted that the Sp{2) matrices can be constructed from two different sets of 
generators. We call one of them squeeze representation, and the other shear representation. 
In Sec. PH it is shown that the most general form of the Sp(2) matrices or ABCD matrices 
can be decomposed into one symmetric matrix and one orthogonal matrix. It is shown 
that the symmetric matrix can be decomposed into four shear matrices and the orthogonal 
matrix into three. In Sec. |IV|, from the traditional point of view, we are discussing para-axial 
lens optics. We shall present a new result in this well-established subject. In Sec. M, we 
discuss other areas of optical sciences where the binary representation of the group Sp(2) 
may serve useful purposes. We discuss also possible extension of the ABCD matrix to a 
complex representation, which will enlarge the group Sp{2) to a larger group. 

II. SQUEEZE AND SHEAR REPRESENTATIONS OF THE SP(2) GROUP 

Since the ABCD matrix is a representation of the group Sp(2), we borrow mathematical 
tools from this group. This group is generated by 



when they are applied to a two-dimensional xy space. The L matrix generates rotations 
around the origin while Bi, and B2 generate squeezes along the xy axes and along the axes 
rotated by 45° respectively. This aspect of Sp(2) is well known. Let us consider a different 
representation. 





(5) 
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The shear matrices of Eq. (JJ) can be written as 

J J) = exp(-isXi) 



1 

u 1 



exp (-ml 2 ) , (6) 



with 



which serve as the generators. If we introduce a third matrix 

i 
-i 



*a=(n °-). (8) 



it generates squeeze transformations: 

expHr / X 3 ) = ( e J e ° r; ). (9) 

The matrices Xi,X 2 , and X3 form the following closed set of commutation relations. 

[X 1 ,X 2 ]=iX 3 , [X 1 ,X 3 ] = -2iX 1 , 

[X 2 ,X 3 ]=2iX 2 . (10) 

As we noted in Eq.(|6]), the matrices X\ and X 2 generate shear transformations [p], [10| , [TT]| . 
The matrix X3 generate squeeze transformations. Thus what is the group generated by one 
squeeze and two shear transformations? 

The generators of Eq. (|7|) and Eq. @ can be written as 

X 1 = B 2 -L, X 2 = B 2 + L, X 3 = 2B U (11) 

where L, Bi and B 2 are given in Eq.(|5]). The Sp(2) group can now be generated by two 
seemingly different sets of generators namely the squeeze-rotation generators of Eq. (|5[) and 
the shear-squeeze generators of Eq . (|TTD . We call the representations generated by them the 
"squeeze" and "shear" representations respectively. It is quite clear that one representation 
can be transformed into the other at the level of generators. Our experience in the conven- 
tional squeeze representation tells us that an arbitrary Sp(2) matrix can be decomposed into 
squeeze and rotation matrices. Likewise then, we should be able to decompose the arbitrary 
matrix into shear and squeeze matrices. 

We are quite familiar with Sp(2) matrices generated by the matrices given in Eq.(|5]). As 
shown in Appendix A, the most general form can be written as 

Q_f cos 4> — smcf)\/e v \/cosA — sinA\ 
\sin0 cos0 / V e~ v J \sinA cos A / ' 

where the three free parameters are 0, 77 and A. The real numbers A, B, C and D in Eq.(|3D 
can be written in terms of these three parameters. Conversely, the parameters 0, 77 and A 
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can be written in terms of A,B,C and D with the condition that AD — BC = 1. This 
matrix is of course written in terms of squeeze and rotation matrices. 

Our next question is whether it is possible to write the same matrix in the shear repre- 
sentation. In the shear representation, the components should be in the form of T and L 
matrices given in Eq.(|T]) and a squeeze matrix of the form 



e* 1 
e" r 



(13) 



because they are generated by the matrices given in Eq.(^) and Eq.(|8|). But this mathemat- 
ical problem is not our main concern. In the present paper, we are interested in whether it 
is possible to decompose the ABCD matrix into shear matrices. 



III. DECOMPOSITIONS AND RECOMPOSITIONS 

We are interested in this paper to write the most general form of the matrix G of Eq. (|3]) 
as a chain of the shear matrices. Indeed, Sudarshan et al. attempted this problem in 
connection with para-axial lens optics. Their approach is of course correct. They concluded 
however that the complete calculation is "tedious" in their paper. 

We propose to complete this well-defined calculation by decomposing the matrix G into 
one symmetric matrix and one orthogonal matrix. For this purpose, let us write the last 
matrix of Eq. ( fl2"|) as 



cos (f) sin \ / cos 9 — sin 9 
— sin 6 cos <t> J V sin 9 cos 9 



(14) 



with A = 9 — <p. Instead of A, 9 becomes an independent parameter. 

The matrix G can now be written as two matrices, one symmetric and the other orthog- 
onal: 

G = SR, (15) 

with 

\ sin 9 cos 9 J 

The symmetric matrix S takes the form 

g _ /cosh 7/ + (sinhr?) cos(20) (sinhry) sin(2</>) \ , . 

V (sinh?]) sin(20) coshr? — (sinh?]) cos(20) / ' 

Our procedure is to write S and R separately as shear chains. Let us consider first the 
rotation matrix. 

In terms of the shears, the rotation matrix R can be written as [[!(]] : 

*=(S -T /2) )U« ~ ta f 2) )- M> 
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This expression is in the form of TLT, but it can also be written in the form of LTL. If we 
take the transpose and change the sign of 6, R becomes 

R'-f 1 ^f 1 ~^ d \( 1 °^ (1 a) 

Uan(0/2) l)\0 1 )\ttm(e/2) l)' 1 > 

Both R and R' are the same matrix but are decomposed in different ways. 

As for the two-parameter symmetric matrix of Eq.(|l7]), we start with a symmetric LTLT 
form 

!)(! !)■ <»> 

which can be combined into one symmetric matrix: 

q _f l + a 2 b(l + a 2 ) + a \ 

^~\b(l + a 2 ) + a l + 2ab + b 2 (l + a 2 ) J ■ {ll > 



By comparing Eq . (jl7|) and Eq . (pT|) , we can compute the parameters a and b in terms of 77 
and </>. The result is 



a = ±J (cosh?7 — 1) + (sinhr^) cos(20), 



(sinh^) sin(20) =p ^/(cosh^ — 1) + (smlir?) cos(20) 

• (22) 



cosily + (sinhi]) cos(20) 
This matrix can also be written in a TLTL form: 



Then the parameters a' and 6' are 



± w (cosh — 1) — (sinh rj) cos(20), 



(sinh rj) sin(20) =F J (cosh 77 — 1) — (sinh 77) cos(20) 

b' = . (24) 

cosh 77 — (sinh 77) cos(20) 

The difference between the two sets of parameters ab and a'b' is the sign of the parameter 
77. This sign change means that the squeeze operation is in the direction perpendicular to 
the original direction. In choosing ab or a'b', we will also have to take care of the sign of 
the quantity inside the square root to be positive. If cos(20) is sufficiently small, both sets 
are acceptable. On the other hand, if the absolute value of (sinh 77) cos(20) is greater than 
(cosh 77 — 1), only one of the sets, ab or a'b', is valid. 

We can now combine the S and R matrices in order to construct the ABCD matrix. In 
so doing, we can reduce the number of matrices by one 



SR 



1 OWl sWl OWl 6-tan(0/2) 

b l) \0 l) \a l) U 1 

1 OWl -tan(0/2) 

sin 6 1 ) V 1 



(25) 
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We can also combine making the product S'R'. The result is 



1 V \ f 1 0W1 s'W 1 
lJU' 1/ VO 1 J U' + tan(0/2) 1 



For the combination .Si? of Eq. (|25j) , two adjoining T matrices were combined into one T 
matrix. Similarly, two L matrices were combined into one for the S'R' combination of 

In both cases, there are six matrices, consisting of three T and three L matrices. This 
is indeed, the minimum number of shear matrices needed for the most general form for the 
ABCD matrix with three independent parameters. 



IV. PARA-AXIAL LENS OPTICS 

So far, we have been investigating the possibilities of representing the ABCD matrices 
in terms of the two shear matrices. It is an interesting proposition because this binary 
representation could lead to a computer algorithm for computing the ABCD matrix in 
optics as well as in other areas of physics. Indeed, this ABCD matrix has a deep root in 
ray optics 0. 

In para-axial lens optics, the lens and translation matrices take the form 

respectively. Indeed, in the Introduction, this was what we had in mind when we defined 
the shear matrices of L and T types. These matrices are applicable to the two-dimensional 
space of 



y 

m 



(28) 



where y measures the height of the ray, while m is the slope of the ray. 

The one-lens system consists of a TLT chain. The two-lens system can be written as 
TLTLT . If we add more lenses, the chain becomes longer. However, the net result is one 



ABCD matrix with three independent parameters. In Sec. |T|, we asked the question of 
how many L and T matrices are needed to represent the most general form of the ABCD 
matrix. Our conclusion was that six matrices, with three lens matrices, are needed. The 
chain can be either LTLTLT or TLTLTL. In either case, three lenses are required. This 
conclusion was obtained earlier by Sudarshan et al. in 1985 @. In this paper, using the 
decomposition technique derived from the Bargman decomposition, we were able to compute 
the parameter of each shear matrix in terms of the three parameters of the ABCD matrix. 

In para-axial optics, we often encounter special forms of the ABCD matrix. For instance, 
the matrix of the form of Eq.(|13D is for pure magnification fL2| . This is a special case of the 



decomposition given for S and S' in Eq.(|2~ID and Eq. (|23|) respectively, with = 0. However, 
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if r] is positive, the set a'b' is not acceptable because the quantity in the square root in 
Eq.(^4|) becomes negative. For the ab set, 

a = ± (e" - 1) 1/2 , b = Te~ v (e" - 1) 1/2 . (29) 

The decomposition of the LTLT type is given in Eq. (f20|) . 
We often encounter the triangular matrices of the form ||13 



A B 
D 





( A 




j or 


\c 


D) 



(30) 



However, from the condition that their determinant be one, these matrices take the form 

e-0 ° r [c e-«)- (31) 

The first and second matrices are used for focal and telescope conditions respectively. We 
call them the matrices of B and C types respectively. The question then is how many shear 
matrices are needed to represent the most general form of these matrices. The triangular 
matrix of Eq.(^) is discussed frequently in the literature [jTJ,[n|. In the present paper, we 
are interested in using only shear matrices as elements of decomposition. 
Let us consider the B type. It can be constructed either in the form 

1 s (32) 



1 



or 



1 e^B' 
1 J V 



(33) 



The number of matrices in the chain can be either four or five. We can reach a similar 
conclusion for the matrix of the C type. 



V. OTHER AREAS OF OPTICAL SCIENCES 



We write the ABCD matrix for the ray transfer matrix (12| . There are many ray transfers 
in optics other than para-axial lens optics. For instance, a laser resonator with spherical 
mirrors is exactly like para-axial lens optics if the radius of the mirror is sufficiently large |14| . 



If wave fronts with phase is taken into account, or for Gaussian beams, the elements of the 

In this case, the matrix operation can sometimes 



ABCD matrix becomes complex [|15[|16 
be written as 



w 



Aw + B 
Cw + D'' 



(34) 



where w is & complex number with two real parameters. This is precisely the bilinear 
representation of the six-parameter Lorentz group 0. This bilinear representation was 
discussed in detail for polarization optics by Han et al. [17]. This form of representation is 



useful also in laser mode-locking and optical pulse transmission [16 



S 



The bilinear form of Eq.(j34|) is equivalent to the matrix transformation [17 



A B\( Vl 
C Dj\v 2 



(35) 



with 



w = - (36) 

This bilinear representation deals only with the ratio of the second component to the first in 
the column vector to which ABCD matrix is applicable. In polarization optics, for instance, 
v i and f 2 correspond to the two orthogonal elements of polarization. 

Indeed, this six-parameter group can accommodate a wide spectrum of optics and other 
sciences. Recently, the two-by-two Jones matrix and four-by-four Mueller matrix have been 
shown to be two-by-two and four- by-four representations of the Lorentz group [l|]. Also re- 
cently, Monzon and Sanchez showed that multilayer optics could serve as an analog computer 
for special relativity ||. More recently, two-beam interferometers can also be formulated in 
terms of the Lorentz group (TBI . 



CONCLUDING REMARKS 

The Lorentz group was introduced to physics as a mathematical device to deal with 
Lorentz transformations in special relativity. However, this group is becoming the major 
language in optical sciences. With the appearance of squeezed states as two-photon coherent 
states |19j , the Lorentz group was recognized as the theoretical backbone of coherent states 



as well as generalized coherent states ||. 

In their recent paper ||, Han et al. studied in detail possible optical devices which 
produce the shear matrices of Eq.(p. This effect is due to the mathematical identity called 
"Iwasawa decomposition" [P^PTJ] , an d this mathematical technique is relatively new in op- 
tics. The shear matrices of Eq.(|l]) are products of Iwasawa decompositions. Since we are 
using those matrices to produce the most general form of ABCD, we are performing inverse 
processes of the Iwasawa decomposition. 

It should be noted that the decomposition we used in this paper has a specific purpose. 
If purposes are different, different forms of decomposition may be employed. For instance, 
decomposition of the ABCD matrix into shear, squeeze, and rotation matrix could serve 
useful purposes for canonical operator representations fll3 , p2| . The amount of calculation 
seems to depend on the choice of decomposition. 

Group theory in the past was understood as an abstract mathematics. In this paper, we 
have seen that it can be used as a calculational tool. We have also noted that there is a 
place in computer science for group theoretical tools. 
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APPENDIX A: BARGMANN DECOMPOSITION 



In his 1947 paper ||, Bargmann considered 

w '=(; £)■ < ai > 

with aa* — (3(3* = 1. There are three independent parameters. Bargmann then observed 
that a and (3 can be written as 

a = (coshr/)e- iW,+A) , (3 = (sinhr7)e- i( *- A) . (A2) 

Then W can be decomposed into 

_ (e-^ \ /coshr? sinh 77 \ / e~ iX \ ( . . 

V e^JUinhr/ cosh 7/ J V e iA J' ^ ^ 

In order to transform the above expression into the decomposition of Eq. (|i~2|) , we take the 
conjugate of each of the matrices with 

Then dWCi 1 leads to 



cos 4> — sin (j>\ ( cosh rj sinh 77 \ / cos A — sin A 
sin <p cos <p ) \ sinh 77 cosh 77 y \ sin A cos A 



(A5) 



We can then take another conjugate with 



c -7f(-i !.)■ (A6) 



Then the conjugate C^C^WCy 1 C 2 1 becomes 

cos — sin \ / e v \ / cos A — sin A 
sin0 cos0 / \ e -?? / \sinA cos A 

This expression is the same as the decomposition given in Eq. fll2|) . 
The combined effect of C 2 Ci is 



(A7) 



1 / e *T/4 giTT/4 \ 

= 7! l-e^ 4 e— / 4 J- (A8) 

If we take the conjugate of the matrix W of Eq. (|^jD using the above matrix, the elements 
of the ABCD matrix become 

A = a + a* + (3 + (3\ 

B = —i(a — a* + (3 — (3*), 

C=-i(a-a* -P + P*), 

D = a + a*-p-P*. (A9) 
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It is from this expression that all the elements in the ABCD matrix are real numbers. 
Indeed, the representation a (3 is equivalent to the ABCD representation. In terms of the 
parameters A, rj and 0, 



A = (cosh rf) cos(0 + A) + (sinh 77) cos(0 — A), 
B = (cosh rj) sin(0 + A) + (sinh rj) sin(0 — A) , 
C = (cosh rj) sin(0 + A) — (sinh rf) sin(0 — A), 

D = (cosh?]) cos(0 + A) — (sinh 77) cos(0 — A). (A10) 



11 



REFERENCES 



[1] D. Han, Y. S. Kim, and M. E. Noz, J. Opt. Soc. Am. A 14, 2290 (1997); D. Han, Y. S. 

Kim, and M. E. Noz, Phys. Rev. E 56, 6065 (1997). 
[2] D. Han, Y. S. Kim, and M. E. Noz, Phys. Rev. E 60, 1036 (1999). 
[3] Y. S. Kim and E. P. Wigner, Am. J. Phys. 58, 439 (1990). 

[4] Y. S. Kim and M. E. Noz, Phase Space Picture of Quantum Mechanics (World Scientific, 
Singapore, 1991). 

[5] J. J. Monzon and L. L. Sanchez-Soto, Phys. Lett. A 262, 18 (1999). 
[6] H. Kogelnik and T. Li, Applied Optics 5, 1550 (1966), and the references listed in this 
review paper. 

[7] E. C. G. Sudarshan, N. Mukunda, and R. Simon, Optica Acta 32, 855 (1985). 

[8] D. Han, Y. S. Kim, and M. E. Noz, Am. J. Phys. 67, 61 (1999). 

[9] V. Bargmann, Ann. Math. 48, 568 (1947). 
[10] A. W. Lohmann, J. Opt. Soc. Am. A 10, 2181 (1993). 
[11] D. Onciul, Optik 96, 20 (1994). 

[12] A. Gerrard and J. M. Burch, Introduction to Matrix Methods in Optics (John Wiley & 

Sons, New York, 1975). 
[13] R. Simon and K. B. Wolf, J. Opt. Soc. Am. A 17, 342 (2000). 
[14] W. K. Kahn, Applied Optics 4, 758 (1965). 
[15] H. Kogelnik, Applied Optics 4, 1562 (1965). 

[16] M. Nakazawa and J. H. Kubota, A. Sahara, and K. Tamura, IEEE Journal of Quantum 

Electronics 34, 1075 (1998). 
[17] D. Han, Y. S. Kim, and M. E. Noz, Phys. Lett. A 219, 26 (1996). 
[18] D. Han, Y. S. Kim, and M. E. Noz, Phys. Rev. E 61, 5907 (2000). 
[19] H. P. Yuen, Phys. Rev. A 13, 2226 (1976). 

[20] K. Iwasawa, Ann. Math. 50, 507 (1949); R. Hermann, Lie Groups for Physicists (W. 

A. Benjamin, New York, 1966). 
[21] R. Simon and N. Mukunda, J. Opt. Soc. Am. A 15, 2146 (1998). 
[22] M. Nazarathy and J. Shamir, J. Opt. Soc. Am. 72, 356 (1982); H. Sasaki, K. Shinozaki, 

and T. Kamijoh, Opt. Eng. 35, 2240 (1996). 



12 



